Displacement-Invariant Cost Computation for Stereo Matching
نویسندگان
چکیده
Abstract Although deep learning-based methods have dominated stereo matching leaderboards by yielding unprecedented disparity accuracy, their inference time is typically slow, i.e. , less than 4 FPS for a pair of 540p images. The main reason that the leading employ time-consuming 3D convolutions applied to 4D feature volume. A common way speed up computation downsample volume, but this loses high-frequency details. To overcome these challenges, we propose displacement-invariant cost module compute costs without needing Rather, are computed applying same 2D convolution network on each disparity-shifted map independently. Unlike previous convolution-based simply perform context mapping between inputs and maps, our proposed approach learns match features two We also an entropy-based refinement strategy refine map, which further improves avoiding need second right image. Extensive experiments standard datasets (SceneFlow, KITTI, ETH3D, Middlebury) demonstrate method achieves competitive accuracy with much time. On typical image sizes ( e.g. $$540\times 960$$ 540 × 960 ), processes over 100 desktop GPU, making suitable time-critical applications such as autonomous driving. show generalizes well unseen datasets, outperforming 4D-volumetric methods. will release source code ensure reproducibility.
منابع مشابه
Fast Cost Computation Using Binary Information for Illumination Invariant Stereo Matching
Stereo matching methods are used to estimate disparity values from captured stereo images. They exploit characteristics of binocular disparity for disparity estimation. Stereo images captured under practical conditions have different illumination status, which causes disparity errors in the matching operation. To solve this problem, we can use the adaptive normalized cross correlation (ANCC) as...
متن کاملBlur and Contrast Invariant Fast Stereo Matching
We propose a novel approach for estimating a depth-map from a pair of rectified stereo images degraded by blur and contrast change. At each location in image space, information is encoded with a new class of descriptors that are invariant to convolution with centrally symmetric PSF and to variations in contrast. The descriptors are based on local-phase quantization, they can be computed very ef...
متن کاملA method for learning matching errors for stereo computation
This paper describes a novel learning-based approach for improving the performance of stereo computation. It is based on the observation that whether the image matching scores lead to true or erroneous depth values is dependent on the original stereo images and the underlying scene structure. This function is learned from training data and is integrated into a depth estimation algorithm using t...
متن کاملA Method for Learning Matching Errors in Stereo Computation
This paper describes a novel learning-based approach for improving the performance of stereo computation. It is based on the observation that whether the image matching scores lead to true or erroneous depth values is dependent on the original stereo images and the underlying scene structure. This function is learned from training data and is integrated into a depth estimation algorithm using t...
متن کاملA Method for Learning Matching Errors in Stereo Computation
This paper describes a novel learning-based approach for improving the performance of stereo computation. It is based on the observation that whether the image matching scores lead to true or erroneous depth values is dependent on the original stereo images and the underlying scene structure. This function is learned from training data and is integrated into a depth estimation algorithm using t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2022
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-022-01595-8